Efficient Implementation of Joins over Cassandra DBs

نویسندگان

  • Haridimos Kondylakis
  • Antonis Fountouris
  • Dimitris Plexousakis
چکیده

Over the last few years we witness an explosion on the development of data management solutions for big data applications. To this direction NoSQL databases provide new opportunities by enabling elastic scaling, fault tolerance, high availability and schema flexibility. Despite these benefits, their limitations in the flexibility of query mechanisms impose a real barrier for any application that has not predetermined access use-cases. One of the main reasons for this bottleneck is that NoSQL databases do not support joins. In this poster we present a solution that efficiently supports joins over such databases. More specifically, we present a query optimization and execution module placed on top of Cassandra clusters that is able to efficiently combine information stored in different columnfamilies. Our preliminary evaluation demonstrates the feasibility of our solution and the advantages gained when compared to a recent commercial solution by DataStax. To the best of our knowledge our approach is the first and the only available open source solution allowing joins over NoSQL Cassandra databases.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficient Skew Handling for Outer Joins in a Cloud Computing Environment

Outer joins are ubiquitous in many workloads and Big Data systems. The question of how to best execute outer joins in large parallel systems is particularly challenging, as real world datasets are characterized by data skew leading to performance issues. Although skew handling techniques have been extensively studied for inner joins, there is little published work solving the corresponding prob...

متن کامل

Efficient implementation of low time complexity and pipelined bit-parallel polynomial basis multiplier over binary finite fields

This paper presents two efficient implementations of fast and pipelined bit-parallel polynomial basis multipliers over GF (2m) by irreducible pentanomials and trinomials. The architecture of the first multiplier is based on a parallel and independent computation of powers of the polynomial variable. In the second structure only even powers of the polynomial variable are used. The par...

متن کامل

Measurement of in vivo therapeutic mAb concentrations: comparison of conventional serum/plasma collection and analysis to dried blood spot sampling.

BACKGROUND Development of an alternative sampling method that uses small amounts of whole blood, such as dried blood spots (DBS), would be an advance in the quantitative assay field. Previously, we assessed the ability to quantitate therapeutic monoclonal antibodies present in DBS compared with a typical serum sample-based method, and concluded that measurements in DBS were reproducible and yie...

متن کامل

The Joins Concurrency Library

Cω extended C 1.x with a simple, declarative and powerful model of concurrency join patterns applicable both to multithreaded applications and to the orchestration of asynchronous, event-based distributed applications. With Generics available in C 2.0, we can now provide join patterns as a library rather than a language feature. The Joins library extends its clients with an embedded, type-safe ...

متن کامل

The Design and Implementation of Seeded Trees: An Efficient Method for Spatial Joins

Existing methods for spatial joins require pre-existing spatial indices or other precomputation, but such approaches are inefficient and limited in generality. Operand data sets of spatial joins may not all have precomputed indices, particularly when they are dynamically generated by other selection or join operations. Also, existing spatial indices are mostly designed for spatial selections, a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016